Research on Phoneme Sequences for Language Identification and Concurrent Voice Transmission

نویسنده

Pratima Sharma

چکیده

Language Identification is process of identifying the language being spoken from a sample of speech by an unknown speaker. Most of the previous work in this field is based on the fact that phoneme sequences have different occurrence probabilities in different languages, and all the systems designed till now have tried to exploit this fact. Language identification process in turn consists of two sub-systems. First system converts speech into some intermediate form called as phoneme sequences, which are used to model the language by doing their probabilistic analysis in the second sub-system. In this project both of the sub-systems are targeted. First some algorithms are discussed for designing language models. Then an attempt is made to design an algorithm for extracting phoneme sequences in form of more abstract classes derived by statistical tools like Gaussian Mixture Models (GMM) and Hidden Markov Model (HMM).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Identification and Locale Based Web Browser: New Approach

―Voice recognition has come a long way in the last few years,‖ as said plain and simple by Judge Thomas C. Smith, author of Dictating to your Computer (for judges and lawyers) (1). What was once a sceptical and highly inaccurate, almost useless device, voice recognition is now an incredibly helpful and advanced tool. Voice recognition is defined by www.techtarget.com as simply, ―the ability of ...

متن کامل

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Automatic Speaker and Language Identification

This thesis deals with the problem of automatic language identification (LID) and automatic speaker identification (SID) given the speech signal as input. Both researches have received renewed interest due to heightened homeland security awareness, e.g. in the use of speaker’s voice print for biometric identification, language identification for the classification of speech archives and call-sc...

متن کامل

Pump-priming PASCAL proposal: Large Margin Algorithms and Kernel Methods for Speech Applications

Research on large margin algorithms in conjunctions with kernels methods has been both exciting and successful. While there have been quite a few preliminary successes in applying kernel methods for speech applications, most the research efforts have focused on non-temporal problems such as text classification and optical character recognition (OCR). We propose to design, analyze, and implement...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Research on Phoneme Sequences for Language Identification and Concurrent Voice Transmission

نویسنده

چکیده

منابع مشابه

Language Identification and Locale Based Web Browser: New Approach

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Automatic Speaker and Language Identification

Pump-priming PASCAL proposal: Large Margin Algorithms and Kernel Methods for Speech Applications

عنوان ژورنال:

اشتراک گذاری